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SYSTEM, METHOD, AND COMPUTER-READABLE MEDIUM FOR ANALYZING AN 

MPEG-FORMATTED FILE 



TECHNICAL FIELD 

[0001] This invention relates to video and audio technologies and, more 
particularly, to a system, method, and computer-readable medium for analyzing a moving 
pictures expert group-formatted file. 

BACKGROUND 

[0002] The proliferation of the Internet has resulted in consumer demand for 
numerous bandwidth-intensive services. As market penetration of wideband consumer 
communications equipment, such as digital subscriber line technologies, cable modems and 
other high-bandwidth end-user equipment, increases, the demand for data-intensive services 
also increases. Downloading music and video files has already become commonplace for a 
large portion of Internet users. Moreover, the prevalence of consumer imaging technologies, 
such as digital cameras, digital camcorders and other digital media recorders, and personal 
computer (PC) video editing software applications has increased demand for data-intensive 
storage technologies. 

[0003] Advances in data storage technologies are continually being made. 
Although magnetic hard drives offer gigabytes of storage space, many users consume massive 
amounts of storage space in relatively short periods of time. Many consumers have turned to 
removable data storage media due to the relatively low cost and ease of use. Compact disc 
(CD) recorders, or "bumers, ,, are particularly attractive to consumers seeking low cost and 
high data storage capabilities. The introduction of re-writeable CD burners has further 
increased the convenience of compact disc media. Furthermore, the interchange capabilities 
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provided by writeable compact disc media further enhances the market acceptance of compact 
disc drives and media. Digital versatile disc (DVD) recorders utilizing recordable DVD and 
re-writeable DVD technologies also provide an attractive format for consumers to store large 
multimedia files generated by, for example, camcorders, PC video, and the like, because of 
the advanced features of DVD such as interactivity, direct access, large storage capacity, and 
nominal cost. 

[0004] Although writeable DVD drives are capable of writing data to DVDs that 
are compatible with most available DVD players for reading data threrefrom, e.g., DVD 
drives included in a personal computer system, a barrier remains for effectively harnessing 
the capabilities of writeable DVDs. Presently, specialized software, commonly referred to as 
video mastering software, is required to format video files, such as moving pictures expert 
group- 1 and -2 (MPEG-1 and MPEG-2) files and other computer video and audio file 
formats, into a form suitable for storage and playback by a DVD decoder. The DVD-video 
specification currently provides for 133 minutes per DVD data layer, or more, of high quality 
MPEG-2 encoded video with multi-channel surround sound. However, MPEG-2 standards 
are less stringent than DVD-video standards, and due to the abundance of variant MPEG 
coders, variations in MPEG-2 formatted files are common. Consequently, files may be 
suitably formatted for playback by an MPEG-2 decoder yet unsuitable for storage and 
playback by a DVD decoder. 



SUMMARY OF THE INVENTION 

[0005] In accordance with an embodiment of the present invention, a method of 
analyzing a MPEG-formatted file comprises defining a rule comprising at least one parameter 
that logically defines a format requirement. The method also comprises reading a portion of 
the file, comparing the portion of the file with the rule, and determining whether the file 
violates the rule. 

[0006] In accordance with another embodiment of the present invention, a system 
for analyzing a moving pictures expert group (MPEG)-formatted file comprises a format 
analysis application and a processing element operable to execute the application. The 
application is adapted to read a rule having at least one logical instruction defining a format 
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requirement and compare a portion of the file with the rule. A determination of validity of 
the file is made by the application dependent upon a result of the comparison. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0007] For a more complete understanding of the present invention, the objects 
and advantages thereof, reference is now made to the following descriptions taken in 
connection with the accompanying drawings in which: 

[0008] FIGURE 1 is a diagram illustrating a portion of a moving pictures expert 
group frame sequence for use with an embodiment of the present invention; 

[0009] FIGURE 2 is a diagram illustrating a sequence header that may be 
followed by a group of pictures header in a video/audio object for use with an embodiment of 
the present invention; 

[0010] FIGURE 3 is a block diagram illustrating an embodiment of a moving 
pictures expert group multimedia object analysis system in accordance with the present 
invention; 

[0011] FIGURE 4 is a diagram illustrating a software application for performing 
analysis of file content and format as may be implemented according to an embodiment of the 
present invention; 

[0012] FIGURE 5 is a diagram illustrating one example rule that may be included 
within a rule set and stored in a database retrievable by the application described with 
reference to FIGURE 4, according to an embodiment of the present invention; and 

[0013] FIGURE 6 is a flowchart illustrating an embodiment of a moving pictures 
expert group multimedia object analysis method in accordance with the present invention. 

DETAILED DESCRIPTION OF THE DRAWINGS 

[0014] The preferred embodiment of the present invention and its advantages are 
best understood by referring to FIGURES 1-6 of the drawings, like numerals being used for 
like and corresponding parts of the various drawings. 

[0015] MPEG-2 is most often the compression choice for DVD-video although 
MPEG-1 compressed video may be used as well. MPEG-2 provides superior video quality 
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and is used for such applications as digital television and DVD-video and is defined in 
international organization for standardization (ISO) 13818. Briefly, MPEG video comprises 
sequences of video frames, or pictures, subjected to compression. In general, each video 
frame is divided into macroblocks each comprising four blocks of luminance (Y) and one 
block each of U and V color. Each macroblock is compressed using well-known discrete 
cosine transform (DCT) encoding and motion compensation. 

[0016] Each MPEG video frame may be any one of three general types: an intra- 
coded (I) frame, a predicted (P) frame, or a bi-directional (B) frame. I frames are encoded 
using DCT encoding and are compressed without reference to any other frame. Typically, I 
frames are included in a frame sequence every 12 to 18 frames. Decoding of a video 
sequence starts at an I frame and, consequently, fast forward, reverse, and direct access are 
performed by decoding an I frame. A P frame is coded as a difference from a previous I 
frame or, alternatively, another previous P frame. B frames are encoded as differences from 
previous or subsequent frames in a frame sequence. I, P, and B frames are interleaved in a 
frame sequence with playback always initiated from an I frame. 

[0017] With reference to FIGURE 1, there is illustrated a portion of an example 
MPEG frame sequence 5 OA comprising a sequence of interleaved I frames 51 A and 5 IB, B 
frames 52A-52E, and P frames 53A-53C as may be stored in a video/audio (V/A) or 
multimedia object on a DVD. As shown, each B frame is encoded as a difference from a 
previous or subsequent P frame (AP) or I frame (AI). Each P frame is encoded as a difference 
from the most recent I frame or P frame. For example, P frame 53 A is encoded as a 
difference from previous I frame 51 A while P frame 53B is encoded as a difference from 
previous P frame 53A. The order in which individual frames of sequence 50A are displayed 
during playback will vary from the order in which the frames are stored as is necessary for 
decoding of B frames. 

[0018] A V/A or multimedia object comprises frame sequence 50A. In addition 
to frames 51A-51B, 52A-52E, and 53A-53C, a video sequence 50 of the V/A object has a 
sequence header 70 followed by a group of pictures (GOP) header 60, as shown in the 
simplified schematic of V/A or multimedia object 22A of FIGURE 2. Additionally, an end 
code 80 is appended to video sequence 50. While individual frames 51A-51B, 52A-52E, and 
53A-53C are illustrated as contiguously adjacent in frame sequence 50A, various headers may 
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be inserted between two otherwise adjacent frames. For example, a repeat sequence header 
and/or a GOP header 60 may be inserted between two adjacent frames. 

[0019] Sequence header 70 comprises a 32-bit sequence header code followed by 
various elements that define respective parameters or characteristics of frame sequence 5 OA. 
For example, 12-bit horizontal size value and vertical size value elements respectively define 
the width and height of the displayable portion of a luminance component of encoded frames. 
A 4-bit frame rate code and an 18-bit bit-rate value element respectively define a display 
frame rate and a stream bit rate in bits-per-second to which frame sequence 50A should be 
processed. Sequence header 70 may comprise other data as well, including, but not limited 
to, aspect ratio information and buffer size. 

[0020] GOP header 60 comprises a 32-bit group start code, a 25-bit time code, a 
1-bit closed GOP flag and a 1-bit broken link flag. The group start code is a predefined bit 
string (000001B8) that marks the beginning of a group of pictures header. The 25-bit time 
code comprises various parameters defined by one or more of the 25 bits including, but not 
limited to, a drop frame flag, a time code hours, a time code minutes, time code seconds and 
time code pictures. The closed GOP flag indicates the type of prediction used for adjacent B 
frames following the initial I frame subsequent GOP header 60. The broken link flag is used 
to avoid display of frames that cannot be correctly decoded. 

[0021] FIGURE 3 is a diagram illustrating a MPEG multimedia object analysis 
system 100 in accordance with the present invention. In the illustrated embodiment, a file 
analysis application 122 is stored in a memory unit 40 for analyzing MPEG- formatted data 
according to embodiments of the invention. Application 122 is executed by an operating 
system 120 and one or more processing element(s) 130, such as a central processing unit. 
Operating system 120 controls the resources of system 100 and interfaces the instructions of 
application 122 with processing element 130 to enable application 122 to properly run. 

[0022] As shown, processing element 130 communicates to and drives the other 
elements within system 100 via a local interface 160, which may comprise one or more buses. 
Furthermore, an input device 140, for example a keyboard or a mouse, is used to input data 
from a user of system 100, and an output device 150, for example a display device or a 
printer, is used to output data to the user. A disk storage device 1 70, such as a magnetic disk, 
is connected to local interface 160 for data transfers therewith. A writeable DVD drive 180 is 
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interconnected with interface 160 or one of various peripheral interfaces 190 such as a 
peripheral computer interconnect (PCI) interface, a universal serial bus, or another suitable 
interface. 

[0023] In the embodiment illustrated in FIGURE 3, an MPEG file 175, for 
example an MPEG-2 formatted file, is stored in disk storage device 170. Processing element 
130 retrieves file 175 from device 170 and loads file 175 into memory unit 40 for analysis by 
application 122. Application 122 comprises and/or accesses at least one rule set 123 that 
defines parameters against which particular portions of file 175 may be compared for 
compatibility with DVD video and/or audio format standards for recording to, and playback 
from, a DVD. 

[0024] Prior to the present invention, a particular problem encountered when 
recording of an MPEG file on a writeable DVD is associated with GOP header 60. MPEG-1 
and MPEG-2 formatted video specifications enable all frames of video sequence 50A to be 
encoded as key, or I-frames; inclusion of GOP header 60 in an MPEG encoded multimedia 
file is optional. However, DVD decoders require the inclusion of GOP header 60 for proper 
playback. Thus, MPEG-formatted file 175 may be suitably encoded for playback by an 
MPEG decoder but be unsuitable for playback by a DVD device. Often a user attempting to 
create a DVD with an MPEG-formatted file has no direct knowledge of the source encoder 
used for creation of the MPEG file. In such a situation, the user may process the MPEG file 
with a DVD encoder, or mastering software, and write the file to a DVD as part of DVD file 
structure 5. Not until attempted playback with a DVD decoder does the user become aware 
of an incompatibility between the MPEG file and the required DVD format. In the event the 
source MPEG file was encoded without GOPs, the DVD player will be unable to process the 
DVD file for playback. In addition to inconveniencing the user, the DVD to which the 
MPEG file was written is ruined (in the case that a write-once DVD is used) or requires 
erasing (in the case that a re-writeable DVD is used). 

[0025] Embodiments of the present invention facilitate detection of incompatibly 
or noncompliantly encoded MPEG files by analyzing at least one portion of the MPEG- 
formatted file for discrepancies with DVD video and/or audio format requirements. For 
example, in some embodiments, application 122 reads a V/A object to determine the location 
of sequence header 70. The data content of the V/A object intermediate sequence header 70 
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and frame sequence 50A is analyzed, for example subjected to a pattern recognition 
algorithm, and evaluated for the presence of GOP header 60. In the event a GOP header is 
not located in the MPEG portion intermediate sequence header 60 and frame sequence 5 OA, 
application 122 determines that the MPEG- formatted file is incompatible for recording on a 
DVD. Application 122 may provide a notification, for example display of a message on 
output device 150, to inform the user accordingly and/or transcode the MPEG file, that is 
decode the MPEG data and subsequently re-encode the decoded data according to the MPEG 
rule set. 

[0026] MPEG specifications define header codes for various fields of a data 
stream. TABLE A summarizes various MPEG start codes that may be evaluated for 
compatibility with DVD file structure standards. In one embodiment, application 122 
identifies one or more portions of MPEG file 175 for analysis by identification of particular 
start codes. For example, application 122 determines the location of sequence header 70 by 
recognition of the sequence header start code and scans the portion of the MPEG data stream 
subsequent the sequence header start code until either sequence end code 80 or a group of 
pictures start code is encountered. In the event sequence end code 80 is encountered without 
detection of a group of pictures start code, application 122 determines that the MPEG file 175 
is unsuitably encoded for playback from a DVD decoder and preferably provides an 
indication thereof to the user. 



TABLE A 



Start Code 


Designated Field 


0x00 


Picture 


OxOl-OxAF 


Slice 


OxBO 


Reserved 


OxBl 


Reserved 


0xB2 


User data 


0xB3 


Sequence header 


0xB4 


Sequence error 


0xB5 


Extension 


0xB6 


Reserved 


0xB7 


Sequence end 


0xB8 


Group of Pictures 



[0027] In another embodiment of the present invention, application 122 evaluates 
an MPEG-file 1 75 for encoding characteristics that violate not only DVD specifications, but 
MPEG specifications as well. For example, ISO 13818 specifies that MPEG-2 video has a 
maximum allowable data rate of 9.8 Mb/s. Each MPEG sequence header 70 includes an 18- 
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bit field that specifies the sequence bit rate. Application 122 reads the bit rate field value and 
determines whether the MPEG file 175 is encoded in excess of the allowable encoding rate. 
Preferably, application 122 determines the bit rate of a frame sequence by reading a portion of 
MPEG file 175 by addressing the sequence header (0xB3) and an appropriate offset. The bit 
rate field is allocated the 8 th and 9 th bytes of the sequence header 70 and bits 7 and 6 of 
sequence header 70 byte 10. 

[0028] In some embodiments, application 122 first evaluates an MPEG rule set 
comprised of one or more rules logically defining one or more characteristics of the MPEG 
standard, e.g. ISO 13818. For example, the MPEG rule set may comprise a bit rate rule that 
defines a maximum allowable bit rate for an acceptably formatted MPEG file. Violation of 
any rule of the MPEG rule set may result in notice thereof provided as an output display 
message to the user and/or transcoding of the MPEG file. An evaluation of the encoded bit 
rate of an MPEG video stream is exemplary only and a rule set evaluated according to 
embodiments of the present invention may comprise tens, or even hundreds, of individual 
rules. ; 

[0029] In some embodiments, after evaluation of file 175 for violation of any rules 
of the MPEG rule set, MPEG file 175 is compared against a compatibility rule set comprised 
of one or more rules logically defining DVD format requirements. For example, the 
compatibility rule set may include a rule comprising one or more parameters logically 
defining the inclusion of GOP header 60 in MPEG file 175 for compatibility with DVD video 
decoder requirements. A determination that GOP header 60 does not exist in file 175 results 
in a compatibility violation and a notice thereof is preferably provided to the user by 
application 122. Other rules of the compatibility rule set test for other DVD-format 
incompatibilities. In the embodiments described above, application 122 first evaluates an 
MPEG rule set comprised of one or more rules logically defining one or more characteristics 
of the MPEG standard to evaluate MPEG file 175 and then compares MPEG file 175 with a 
compatibility rule set comprised of one or more rules logically defining DVD format 
requirements.. However, it should be understood that the order of rule set analysis may be 
otherwise varied. 

[0030] Preferably, application 122 is implemented in software and can be adapted 
to run on different platforms and operating systems. In particular, logical functions 
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implemented by application 122 may be provided as an ordered listing of executable 
instructions that can be embodied in any computer-readable medium for use by or in 
connection with an instruction execution system, apparatus, or device, such as a computer- 
based system, processor-containing system, or other system that can fetch the instructions 
from the instruction execution system, apparatus, or device, and execute the instructions. In 
the context of this document, a "computer-readable medium" can be any means that can 
contain, store, communicate, propagate or transport the program for use by or in connection 
with the instruction execution system, apparatus, or device. The computer-readable medium 
can be, for example, but is not limited to, an electronic, magnetic, optical, electro-magnetic, 
infrared, or semi-conductor system, apparatus, device, or propagation medium. 

[0031] FIGURE 4 is a schematic of application 122 as may be implemented in an 
embodiment of the present invention. Application 122 is executed by O/S 120 and interfaces 
or includes one or more rule sets 123. In the illustrative example, rule sets 123 comprise an 
MPEG rule set 123 A and a compatibility rule set 123B, each respectively comprising one or 
more rules 123Ai-123A N and 123Bi-123B N . Rule sets 123 are preferably maintained in a 
database stored in storage device 170 and retrievable by application 122. For example, rule 
sets 123 may be maintained in a table comprising records, each respectively defining a rule 
123A]-123B N . Rules 123Ai-123B N may comprise respective routines and/or subroutines 
executable by application 122, functions, or operands that parametrically define an MPEG or 
DVD format requirement. For example, rules 123Ai-123B N may comprise one or more 
parameters that logically define a format requirement that is processed by application 122. 

[0032] FIGURE 5 is a schematic of an example rule 123Ai that may be included 
within MPEG rule set 123 A retrievable by application 122, according to an embodiment of 
the present invention. Rule 123Ai comprises fields 126, each containing one or more 
parameters for analysis by application 122. In the present example, rule 123Ai comprises 
logical parameters defining a pre-determined maximum bit rate for MPEG-formatted video. 
In the illustrative example, a first field 126i identifies a start code, e.g., 0xB3, that designates 
the beginning of sequence header 70. Application 122 scans a portion of file 175 until 
recognition of the start code identified by field 126i is made. Field 126 2 specifies an offset to 
be addressed and read from the address of the associated start code. The address offset 
8:10(6) in this example specifies that bytes 8 and 9 and the two most significant bits of byte 
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10 - the allocated address space of an MPEG-2 sequence header for the bit rate field - are to 
be read by application 122 and compared with a maximum value (Max_value) allowed. 
Application 122 may perform a Boolean comparison between the bit field value read from the 
specified address designated by fields 126i and 126 2 and the maximum value specified by 
field 126 3 , and generate a valid or invalid designation therefrom. In a similar manner, other 
rules of rule set 123 A are evaluated and compatibility rule set 123B as well. Preferably, any 
evaluation of a format or incompatibility violation is appended to an error log that is output 
upon processing of all rules 132Ai-123B N . 

[0033] FIGURE 6 is a flowchart 300 of application 122 processing according to 
an embodiment of the present invention. Application 122 processing is initiated by retrieving 
MPEG-formatted file 175 for analysis (block 305). Upon retrieval of file 175, an index i, or 
counter, is initialized in the event a plurality of rule sets are to be tested. A rule seti is then 
retrieved by application 122 and a rule index j is initialized (block 310). Rule seti is read by 
application 122 and a portion of MPEG file 175 specified for testing by rulej is read (block 
320). An evaluation of whether the addressed portion of MPEG file 175 is valid is made 
(block 325) according to the indexed rule,. In the event that the tested portion of file 175 does 
not violate rulej, the rule index j is incremented (block 340). Determination of a format 
violation results in application 122 appending an error message to an error log (block 335) 
and incrementing of the rule index (block 330). The rule index j is evaluated to determine if 
any rules remain in the rule set; being tested (block 340). If additional rules remain for 
evaluation of file 175, application 122 returns to processing of the currently indexed rulej 
(block 315). The rule set index i is incremented (block 345) upon completing evaluation of 
all rules of the currently indexed rule seti, and application 122 evaluates whether any 
remaining rule sets remain for testing (block 350). If another rule seti remains for testing, the 
rule index j is re-initialized and the rule seti is retrieved for testing (block 310) and processing 
of the rules of the newly indexed rule set commences. Application 122 processing completes 
by output of an error log after all rules of each rule set (block 355) have been evaluated. 

[0034] It should be understood that in the method described in FIGURE 7, certain 
functions may be omitted, combined, or accomplished in a sequence different than depicted in 
FIGURE 7. Also, it should be understood that the method depicted in FIGURE 7 may be 
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altered to encompass any of the other features or aspects described elsewhere in the 
specification. 



